Decomposing Swedish Compounds Using Memory-Based Learning

نویسنده

  • Karin Friberg
چکیده

Swedish morphology differs significantly from English in several ways. This is something which makes natural language processing based on the English language not always applicable for Swedish material. One area where there is a difference is compounding. The word-forming process of compounding is very productive in Swedish. The compounds are mostly written as one word, without the segmentation point marked in any way. Thus segmentation has to be done in order to interpret the compounds. In this study I have implemented a decomposer which finds the segmentation point in Swedish compounds, making it easier to handle compounds in natural language processing. Brodda’s algorithm for heuristic compound segmentation guided the work. The decomposer is implemented in TiMBL, a memory-based learner.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Processing of Swedish Compounds for Phrase-Based Statistical Machine Translation

We investigated the effects of processing Swedish compounds for phrase-based SMT between Swedish and English. Compounds were split in a pre-processing step using an unsupervised empirical method. After translation into Swedish, compounds were merged, using a novel merging algorithm. We investigated two ways of handling compound parts, by marking them as compound parts or by normalizing them to ...

متن کامل

Morphological Classification of Swedish Words using Memory-Based Learning

We describe an experimental approach to morphological analysis of Swedish words as a classification problem using memory-based learning (TiMBL). The aim is to find citation forms (or meaningful parts) of words rather than a detailed morphological analysis. We manually annotated 4,189 words for their main segmentation and morphology type: inflection, derivation and compounding. From this annotat...

متن کامل

Memory-Based Dependency Parsing

This paper reports the results of experiments using memory-based learning to guide a deterministic dependency parser for unrestricted natural language text. Using data from a small treebank of Swedish, memory-based classifiers for predicting the next action of the parser are constructed. The accuracy of a classifier as such is evaluated on held-out data derived from the treebank, and its perfor...

متن کامل

The effect of Crocin on scopolamine induced spatial learning and memory deficits in rats

Introduction: The cholinergic system plays an important role in learning and memory. Administration of either extracts of Crocus Sativus (Saffron) or its constituent, crocin, reduced ethanol-induced memory impairment. Based on the above findings, we investigated the effect of crocin in antagonizing spatial learning and memory impairment induced by scopolamine, a cholinergic receptor antagoni...

متن کامل

P58: Visual Working Memory Performance Based on Saccades in Children with and without Specific Learning Disorder: An Eye-Tracking Study

Some of the previous studies show that children with SLD have deficits in visual processing and working memory. Hence, the aim of this research was to investigate problems of visual working memory based on behavioral neuroscience method, using an eye tracker device. The method of present study was ex-post facto study. The participants included couple of twelve children with SLD (mean age=10.92)...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007